Picture for Manyi Zhang

Manyi Zhang

What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study

Add code
Jan 21, 2026
Viaarxiv icon

Benchmarking Post-Training Quantization of Large Language Models under Microscaling Floating Point Formats

Add code
Jan 14, 2026
Viaarxiv icon

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

Add code
Jul 10, 2025
Figure 1 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 2 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 3 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Figure 4 for The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Viaarxiv icon

L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models

Add code
May 23, 2025
Viaarxiv icon

Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Add code
Apr 07, 2025
Viaarxiv icon

Learning with Noisily-labeled Class-imbalanced Data

Add code
Nov 20, 2022
Figure 1 for Learning with Noisily-labeled Class-imbalanced Data
Figure 2 for Learning with Noisily-labeled Class-imbalanced Data
Figure 3 for Learning with Noisily-labeled Class-imbalanced Data
Figure 4 for Learning with Noisily-labeled Class-imbalanced Data
Viaarxiv icon

Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration

Add code
Oct 11, 2022
Figure 1 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Figure 2 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Figure 3 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Figure 4 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Viaarxiv icon